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^^ Abstract 

We present an arbitrage-free non-parametric yield curve prediction model which takes 
the full (discretized) yield curve as state variable. We believe that absence of arbitrage is 
^5 an important model feature in case of highly correlated data, as it is the case for interest 

^N rates. Furthermore, the model structure allows to separate clearly the tasks of estimating 

,— I the volatility structure and of calibrating market prices of risk. The empirical part includes 

Ph tests on modeling assumptions, back testing and a comparison with the Vasicek short rate 

model. 



qh 1 Zero coupon bond prices and yield curves 



Insurance cash flows are valued using the risk-free yield curve. First, today's yield curve needs 
to be estimated from government bonds, swap rates and corporate bonds and, second, future 
yield curves then need to be predicted. This prediction is a complex task because, in general, 
^vj it involves the forecast of infinite dimensional random vectors and/or random functions. In the 

(^ present paper we tackle the problem of yield curve prediction using a non-parametric approach, 

^ which is based on ideas presented in Ortega et al. [^. In contrast to [7| we are heading for long 

T— I term predictions as needed in insurance industry. Assume t > denotes time in years. Choose 

J> T >t and denote, at time t, the price of the (default-free) zero coupon bond (ZCB) that pays 

^ one unit of currency at maturity date T by P(t, T). The yield curve at time t for maturity dates 

5^ T > t is then given by the continuously-compounded spot rate defined by 

>^(t,r) = -^ iogP(t,r). 

Aim and scope. 

Model stochastically the yield curves T i— )• Y{t,T) for future dates t € (0, T) such that: 

(i) the model is free of arbitrage; 

(ii) explains past yield curve observations; 

(iii) allows to predict the future yield curve development. 

In contrast to standard literature on prediction of yield curves we insist that models should 

be free of arbitrage. This requirement is crucial when it comes to the prediction of highly 
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correlated prices as it is the case for interest rates. Otherwise it is possible to "artificially" shift 
P&L distributions. More precisely, if a prediction model admits arbitrage then implementing 
this arbitrage portfolio yields an always positive P&L. In practice adding such an arbitrage 
portfolio can then be used to shift P&L distributions of general portfolios, which is an undesired 



effect from the point of view of valuation and risk management, see Figure 21 and Section 5.4 



Organization of the paper. The remainder of the paper is organized as follows: in Section 
2 we propose our discrete time model for (discretized) yield curve evolution. In Section 3 we 
describe the ubiquitous no arbitrage conditions for our modeling setup. In Section 4 we describe 
the actual calibration procedure and in Section 5 we present a concrete calibration to real market 
data. 

2 Model proposal on a discrete time grid 

Choose a fixed grid size A = 1/n for n G N. We consider the discrete time points t G ANq = 
{0, A, 2A, 3A, . . .} and the maturity dates T E t+AN. For example, the choice n = 1 corresponds 
to a yearly grid, n = 4 to a quarterly grid, n = 12 to a monthly grid, n = 52 to a weekly grid 
and n = 250 to a business days grid. 

The filtered probability space is denoted by (fi,.?-', P, F) with real world probability measure P 
and (discrete time) filtration F = {Tt)teANo- 

We assume that the ZCBs exist at all time points t G ANq for all maturity dates T = t + m 
with times to maturity m G AN. Thus, we can consider the discrete time yield curves 

Yt = (^(i,i + "l))^gAN 

for all time points t G ANq. Assume that (Yi)igANo i^ F-adapted, that is, {Ys)s<t is observable 
at time t and this information is contained in the a-field J-t- Our aim is (as described above) 
to model and predict (Yt)t&/mo- ^^ assume that there exists an equivalent martingale measure 
p* ~ p for the bank account numeraire discount {B^ )ieANo and, in a first step, we describe 
(Y()tgANo directly under this equivalent martingale measure P*. Notice here that the bank 
account numeraire is actually a discrete time roll-over portfolio, as will be seen in the next 
section. 

Remark. The assumption that the yield curve is given at any moment t G ANq for sufficiently 
many maturities is a very strong one. In practice the yield curve is inter- and extrapolated 
every day from quite different traded quantities like coupon bearing bonds, swap rates, etc. 
This inter- and extrapolation allows for a lot of freedom, often parametric families are used, 
e.g. the Nelson-Siegel [6] or the Svensson [HI [9] family, but also non-parametric approaches such 
as splines are applied (see Filipovic [3]). 



3 Stochastic yield curve modeling and no-arbitrage 

Assume the initial yield curve Yq = {Y{0,m))meAN at time t = is given. For t,m £ AN we 
make the following model assumptions: assume there exist deterministic functions a/\{-, ■, •) and 
va("; ■) ■) such that the yield curve has the following stochastic representation 

mY{t,t + m) = {m + A)Y{t- A,t + m) - AY{t- A,t) (3.1) 

+ aA{t, m, (Ys)s<t-A) + VA(t, m, {Ys)s<t-A) £*t, 

where the innovations £* are J^i-measurable and independent of J-t-A under P*. In general, 



the innovations e^ are multivariate random vectors and the last product in (3.1) needs to be 
understood in the inner product sense. 



Remark. The first two terms on the right-hand side of (3.1) will exactly correspond to the no- 



arbitrage condition in a deterministic interest rate model (see (2.2) in Filipovic [3]). The fourth 



term on the right-hand side of (3.1) described by V/\{t,m,(Ys)s<t-A)s:t adds the stochastic 
part to the future yield curve development. Finally, the third term aA{t,m, (ys)s<t-A) will be 
recognized as a Heath-Jarrow-Morton [4J (HJM) term that makes the stochastic model free of 



arbitrage. This term is going to be analyzed in detail in Lemma 3.1 below. This approach allows 
us to separate conceptually the task of estimating volatilities, i.e. estimating va, and estimating 
the market price of risk, i.e. the difference of P and P*. 



Assumption (3.1) implies for the price of the ZCB at time t with time to maturity m 



P(t — A t + m) 
P{t, t + m) = — , ' - — exp {-aA(t, m, {Ys)s<t-A) - VA(i, m, (Ys)s<t-A) £* } ■ 

In order to determine the HJM term dAit^m^^Y s)s<t-A) we define the discrete time bank 
account value for an initial investment of 1 as follows: Bq = 1 and for t G AN 

t/A-l ( t/A~l ~\ 

Bt= W P(As,A(s + l))-i = expJ A ^ y(As, A(s + 1)) i > 0. 

s=0 [ s=0 J 

The process B = {Bt)t<^Ai% considers the roll over of an initial investment 1 into the (discrete 
time) bank account with grid size A. Note that B is previsible, i.e. Bt is J"t_A-ineasurable for 
all t G AN. 

Absence of arbitrage is now expressed in terms of the following (P*,F)-martingale property 
(under the assumption that all the conditional expectations exist). We require for all t,m £ AN 

E*[B^^ P{t,t + m)\Tt^A] = B^\P{t-A,t + m). (3.2) 

The necessity of such a martingale property is due to the fundamental theorem of asset pric- 
ing (FTAP) derived in Delbaen-Schachermayer [2]. For notational convenience we set Ej [•] = 



E*[-|J7] for t G ANq. The no-arbitrage condition (3.2) immediately provides the following 
lemma. 



Lemma 3.1 Under the above assumptions the absence of arbitrage condition (3.2) implies 

QA(i,m, {Ys)s<t-A) = log EJ'^^ [exp{-VA(t,r?T,, (Ys)s<t-A) s^}] . 
This solves item (i) of the aim and scope list. 



Proof of Lemma|3.1[ We rewrite ( 3.2 1 as follows (where we use assumption ( 3.1 1 of the yield curve development 



and the appropriate measurability properties) 

exp {-A Y(t - A, t)} Et_A [P{t, t + m)] = P{t - A, t) Ej.a [P(t, t + m)] 

= P{t- A,t + m) exp{-aA(t,m, (Ys)3<t-A)}lEt_A [exp{-VA(t,m, (Ys)3<t-A) £*t}] 
= P{t- A, t + m). 

Solving this requirement proves the claim of Lemma |3.1[ 



4 Modeling aspects and calibration 

We need to discuss the choices ^ Ait, rn,{Ys)s<t- a) and e^ as well as the description of the 
equivalent martingale measure P* ~ P. Then, the model and the prediction is fully specified 



through Lemma 3.1 



4.1 Data and explicit model choice 

Assume we would like to study a finite set M. C AN of times to maturity. We specify below 
necessary properties of Ai for yield curve prediction. For these times to maturity choices we 
define for t £ AN 



■t,+ 



(y(t,t + m))'„g^ and Yt,- = {Y{t - A,t + m))'^^^ , 



that is, in contrast to Yj the random vectors Yt^-\- and Yt- only consider the times to maturity 
m and m + A for m £ Ai. Note that Y^ _ is J-t„A-measurable and Yt.-\- is J^(-measurable. Our 



aim is to model the change from Yt^_ to Yt,+. In view of (3.1) we define the vector 



Tt = iTt,m)UM = imYit,t + m)-{m + A)Y{t-A,t + m))l^^. 

We set the dimension d = \M\. For St\j^t_^ we then choose a d-dimensional standard Gaussian 
distribution with independent components under the equivalent martingale measure P*. 

Remark. We are aware that the choice of multivariate Gaussian innovations s* is only a 
first step towards more realistic innovation processes. However, we believe that already in this 
model, with suitably chosen estimations of the instantaneous covariance structure, the results 
are quite convincing - additionally chosen jump structures might even improve the situation. 
The independence assumption with respect to the martingale measure is an additional strong 
assumption which could be weakened. 



Thus, we re-scale the volatihty term with the grid size A and assume that at time t it only 
depends on the last observation Yt^_: define va(-, •, •) by 

VA(t, m, {Ys)s<t~A) = VA a{t, m, Yj _), 



where the function <t(-,-,-) does not depend on the grid size A. Lemma 3.1 implies for these 
choices for the HJM term 



aA(t,m, (Ys)s<i_A) = log IE(*_A exp j-VA <T(t,m, Y* _) e^j 



^ \\ait,m,Yt,-)\\\ 



From (3.1 ) we then obtain for t G AN and ?7i G 7W under P* 



A 



-Yit-A,t) + ^\\ait,m,Yt,-)f 



+ VAcr(t,m,Yi,_) e*. 



(4.1) 



Note that {Tt)teAN is a d-dimensional process, thus, we need a d-dimensional Gaussian random 
vector Bt\j^t-A fo^ obtaining full rank and no singularities. Next, we specify explicitly the d- 
dimensional function cr(-,-,-). We proceed similar to Ortega et al. [7j, i.e. we directly model 
volatilities and return directions. Assume that for every y G M'^ there exists an invertible and 
linear map 

?(y):M'^^M^ A ^ ?(y)(A). (4.2) 

In the sequel we identify the linear map ?(y)(-) with the corresponding (invertible) matrix 
<;"(y) G M which generates this linear map, i.e. <f(y)(A) = <;"(y) A. In the next step, we choose 
vectors Ai, . . . , A^^ G M*^ and define the matrix A = [Ai, . . . , A^] G M.'^^'^. Moreover, for y G M*^ 
we set 



SA(y) = c,(y) A A' c,'(y) G 



ixd 



Using vector form we make the following model specification for (4.1): 



Model Assumptions 4.1 We choose the following model for the yield curve at time t G AN 
with time to maturity dates M: 



A 



-Y(t-A,t) + ^sp(SA(Yt,_)) 



+ VA?(Yi,_) Ae*, 



with Y(t — A,t) = {Y{t — A,t), . . . ,Y{t — A,t))' G W^ and sp(Sa) denotes the d- dimensional 
vector that contains the diagonal elements of the matrix Sa G M . 

For the j'-th maturity rrij G A^ we have done the following choice 



cr(t,mj, Yi,_) el = ^ cr,(i, m^, Yi,_) 6^*^ = ^ [?(Yi,_) \i]. el^ 



i=l 



i=l 



The linear map ?(•) describes the volatility scaling factors, Ai, . . . , A^ G M"^ specify the return 
directions, and the volatility choice does not depend on the grid size A. Our aim is to calibrate 
these terms. 



Remark. The volatility scaling factors <;(•) mimic how volatility for different maturities scales 
with the level of yield at this maturity. Several approaches have been discussed in the literature. 
The choice of a square-root dependence seems to be quite robust over different maturities and 
interest rate regimes, but for small rates - as we face it for the Swiss currency CHF - linear 



dependence seems to be a good choice, too, see choice (4.7) 



Lemma 4.2 Under Model Assumptions 4-1 • the random vector Ti|jr^_^ has a d-dimensional 



conditional Gaussian distribution with the first two conditional moments given by 



Cov:_^ (TO 



Y(t-A,t) + ^sp(SA(Yi,_)) 



A 

A SA(Yi,_ 



4.2 Calibration procedure 

In order to calibrate our model we need to choose the volatility scaling factors ?(•) and we 
need to specify the return directions Ai,...,Arf G M.'^ which provide the matrix A. In fact 
we do not need to specify the direction Ai, . . . , A^ G M'^ themselves, but rather Sa, which we 
shall do in the sequel. Assume we have observations {yt)t=A,...,AKi 0^{t — A,t))(=^^ ^(x+i); 
and 0^t-)t=A,...,A{K+i)- We use these observations to predict/approximate the random vector 
'^A(_ft'+i) at time AK. For y G R"^ we define the matrices 






-^([.(Ya., 



■ Afc 



?(y) c^K) c[x) ?'(y) e 



tidxK 



idxd 



k=l,...,K 



Choose t = A.{K + 1). Note that C(x) is -Ft_A-nreasurable. For x, y G 
d-dimensional random vector 



i we define the 



Kt = Kt(x,y) 



1 



A X + - sp (%)(y)) + ?(y) qx) W?, 



(4.3) 



with W* is independent of J-t-A, ^f-measurable, independent of e^ and a X- dimensional stan- 
dard Gaussian random vector with independent components under P*. 



Lemma 4.3 The random vector K,t\j^^_^ 
first two conditional moments given by 

n-Ai^^t] = 

C0Vj_^ {Kt) = 



has a d-dimensional Gaussian distribution with the 



1 



A x+- sp(5'(i^)(y)) 



= S{K){y)- 
Our aim is to show that the matrix 5'(x)(y) is an appropriate estimator for ASA(y) and then 



Lemmas 4.2 and 4.3 say that Kt is an appropriate stochastic approximation to T^, conditionally 
given J"t_A- 



Remark. The random vector Kt can be seen as a filtered historical simulation where W* re- 
simulates the K observations which are appropriately historically scaled through the matrix 



C, 



(K)- 



We calculate the expected value of S(^x\{y) under P*. Choose z,y G M"' and define the function 



/A(z,y) = ?(y)-^ 



-z + -sp(SA(y)) 



-z+-sp(SA(y)) 



i^y)-')'- 



Note that this function does not depend on the grid size A. Lemma 4.2 then implies that 

fA{Y{t-A,t),Yt,-) = A-2,(Yi,„)-iEU[Ti] E^ W (?(Yi,_)-i)' , (4.4) 

where the left-hand side only depends on A through the fact that the yield curve Y(_a is 
observed at time t — A, however otherwise it does not depend on A (as a scaling factor). 



Theorem 4.4 Under Model Assumptions 4-1 we obtain for all K ^ N and y G 



K [V)(y)] =^ SA(y) + A2 ,(y) flf]E*[/A(Y(A(A;- l),AA;),YAfc,-)]j ?(y)'. 
Interpretation. Using Stx)iy) ^s estimator for ASA(y) provides, under Pq, a bias given by 

^' -^(y) (^ E^S [/a(Y(A(A; - 1), Afe), YAfc,-)] j ^(y)'. 

If we choose t = AK fixed and assume that the term in the bracket is uniformly bounded for 
A — )• then we see that 



K [%)(y)] = A SA(y) + A2 0(1), for A ^ 0. 

That is, for small grid size A the second term should become negligible. 



(4.5) 



The only term that still needs to be chosen is the invertible and linear map '^^(y), i.e. the volatility 
scaling factors. For -i? > we define that function 



h: 



i+, y^h{y) = i!}-'/^yl{y<^}+y'^H 



{y>-&}- 



(4.6) 



As already remarked in Subsection 4.1 in the literature one often finds the square-root scaling, 
however for small rates a linear scaling can also be appropriate. For the Swiss currency CHF it 
turns out below that the linear scaling is appropriate for a threshold of ■(? = 2.5%. In addition, 
we define the function h{-) as above to guarantee that the processes do not explode for large 
volatilities and small grid sizes. 
Assume that there exist constants aj > 0, then we set for y = (yi, . . . , ya) G M°' 



?(y) = diag(cri/i(yi),...,crrf/i(yrf)) = diag((Ti, . . . ,0-^) diag(/i(yi), . . . , /i(yd)). 



Basically, volatility is scaled according to the actual observation y. This choice implies 
1 



?(y) C{K) 



K 



?(y) [?(YAfc,-)-' TAfe] 



j=l,...,d; k=l,...,K 



—= diag{h{yi),...,h{yd)) ( [diag(/i(YAfc ,-)) ^'^Ak]A 

\/K ^ J/ j=l,...,d; k=l, 



,K 



thus, the constants aj > do not need to be estimated because they are already (implicitly) 
contained in the observations and, hence, in A. Therefore, we set them to 1 and we choose 



?(y) = diag{h{yi),...,h{yd)). 



(4.7) 



These assumptions now allow to directly analyze the bias term given in (4.5). Therefore, we 



need to evaluate the function /a in Theorem 4.4 However, to this end we would need to know 



Sa, i.e. we obtain from Theorem 4.4 an implicit solution (quadratic form) that can be solved 



for Sa- We set y = 1 and then obtain from Theorem |4.4| 

A-iES[V)(l)] = 5]A(l) + Af-if;E5[/A(Y(A(fe-l),AA:),YAfc,-)]j 



Note that SA(y) = ?(y)^A.'?(y), thus under (4.7) its elements are given by h{yi)h{yj)sij, i,j = 
1, . . . ,d, where we have defined AA' = (sij)jj=i,...,d. Let us first concentrate on the diagonal 
elements, i.e. i = j, and assume that time to maturity rrii corresponds to index i. 



K 



a-'(e;[V)(i)]). 



+ ?E ^s 



k=l 



f y(A(A;-l),Afc) 
\h{Y[/\{k-l),/\k + mi)) 



+ -m [h{Y{l\{k - 1), Afc + m^)f] 4 - ES [y(A(A; - 1), M)] s, 



This is a quadratic equation that can be solved for sa. Define 



A 



K 



^E5[/i(y(A(A:-l),AA: + mO)'], 



k=l 

A 



K 



l-7?E^o[>^(A(A.-l),Afc)], 



fc=i 



A 



K 



-A-Mn [%)(!)]),, + 17E^' 



fc=l 



/ y(A(fc-l),Afc) 
V/i(y(A(A:-l),Afc + mi)) 



(4.8) 

(4.9) 

(4.10) 



then we have ais\ + bsu + Cj = which provides the solution 

-b + V&^ - 4aiCi 



2a,; 



(4.11) 



Thus, the bias terms of the diagonal elements are given by 

ft, = A-i(E5[V)(l)])^^-s,„ 



which we are going to analyze below for the different maturities rrii £ Ai. For the off- diagonals 
i ^ j and the corresponding maturities rrii and rrij we obtain 

^ ^ ■" r(A(fc-l),Afc) Y{A{k-l),Ak) 



A-i(E5[V)(l)]),, = ^^, + ^T.i^o 



l^n 



2 ° 



h{Y{A{k - 1), Afc + m,)) h{Y{A{k - 1), Afc + m^)) 

f mj))/i(y(A(fc-l),Z^ 
y(A(fc--l),Afc + m,) 



fc=i 
+ -E5 [/i(y(A(/c - 1), Afc + m,))h{Y{A{k - 1), Afc + m^))] Si^s^j- 



.^(^(^-i)'^^)MnA(fc-i),Afc+m,)) 

r(A(fc-i),Afc) 



(4.12) 



r(A(fc- i),Afc + TOj) 



7i(y(A(fc- l),Afc + m,)) 
This can easily be solved for Sij for given sa and Sjj. 

5 Calibration to real data 

5.1 Calibration 

For the time-being we assume that P = P*, i.e. we assume that the market price of risk is 

identical equal to 0. This simplifies the calibration and as a consequence we can directly work 

on the observed data. The choice of the drift term will be discussed below. 

The first difficulty is the choice of the data. The reason therefore is that risk-free ZCBs do 

not exist and, thus, the risk-free yield curve needs to be estimated from data that has different 

spreads such as a credit spread, a liquidity spread, a long-term premium, etc. 

We calibrate the model to the Swiss currency CHF. For short times to maturity (below one year) 

one typically chooses either the LIBOR (London InterBank Offered Rate) or the SAR (Swiss 

Average Rate), see Jordan [3], as (almost) risk-free financial instruments. The LIBOR is the 

rate at which highly-credit banks borrow and lend money at the inter-bank market. The SAR 

is a rate determined by the Swiss National Bank at which highly-credited institutions borrow 

and lend money with securization. We display the yields of these two financial time series for 

instruments of a time to maturity of 3 months, see Figure [T] We see that the SAR yield typically 

lies below the LIBOR yield (due to securization). Therefore, we consider the SAR to be less 

risky and we choose it as approximation to a risk-free financial instrument with short time to 

maturity. 

For long times to maturity (above one year) one either chooses government bonds (of sufficiently 

highly rated countries) or swap rates. In Figure^we give the time series of the Swiss government 

bond and the CHF swap yields both for a time to maturity of 5 years. We see that the rate 

of the Swiss government bond is below the swap rate (due to lower credit risk and maybe an 

illiquidity premium coming from a high demand) and therefore we choose Swiss government 

bonds as approximation to the risk-free yield curve data for long times to maturity. 

We mention that these short terms and long terms data are not completely compatible which 

may give some difficulties in the calibration. We will also see this in the correlation matrices 

below. 



Thus, for our analysis we choose the SAR for times to maturity m S {1/52, 1/26, 1/12, 1/4} and 
the Swiss government bond for times to maturity m E {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30}. We 
choose time grid A = 1/52 (i.e. a weekly time grid) and then we calculate T^ for our observations. 
Note that we cannot directly calculate Tt,m = "i ^(i, t + m) — {m + A) Y{t — A, t + m) for all 
m € Ai because we have only a limited set of observed times to maturity. Therefore, we make 
the following interpolation: assume m + A £ {m, m] for m, m G 7W, then approximate 

fYi — (tn -\- A ) A 

Y(t- A,t + m) « ^ Y(t- A,t + m- A) + Y(t - A,t + m - A). 

m — m m — m 

In Figure ^ we give the time series of these estimated {'^t)t and in Figure E^ we give the 
component-wise ordered time series obtained from {Tt)t- We observe that the volatility is 



increasing in the time to maturity due to scaling with time to maturity. Using (4.7) we calculate 

^ ' V- ' -'■?/j = l,...,d; k=l,...,K 

for our observations. In Figures 5 and 6 we plot the time series Tt^m and [vK Ci^K)\m = 
'^t,m/h{Y{t — A, t + m)) for illustrative purposes only for maturities m = 1/52 and m = 5. We 
observe that the scaling ?(Yt^_)~^ gives more stationarity for short times to maturity, however 
in financial stress periods it substantially increases the volatility of the observations, see Figure 
[5] For longer times to maturity one might discuss or even question the scaling because it is less 
obvious whether it is needed, see Figure [6J Next figures will show that this scaling is also needed 
for longer times to maturity. We then calculate the observed matrix 

^r(^)).. , =A-i%)(i) 

as a function of the number of observations K (we set 1 = (1, . . . , 1)' E M'^). Moreover, we 



calculate the bias correction terms given in (4.8)-(4.10) where we simply replace the expected 



values on the right-hand sides by the observations. Formulas (4.11)-(4.12) then provide the 
estimates 'sij{K) for Sij as a function of the number of observations K. The bias correction term 
is estimated by 

%{K)=^^%K)-%,{K). 

We expect that for short times to maturity the bias correction term is larger due to more 
dramatic drifts. The results for selected times to maturity m G {1/52, 1/4, 1, 5, 20} are presented 
in Figures [7 11 Let us comment these figures: 



Times to maturity in the set A^i = {1/52, 1/26, 1/12} look similar to m = 1/52 (Fig- 
ure [T]); M.2 = {1/4} corresponds to Figure [SJ times to maturity in the set M.^ = 
{1,2,3,4,5,6,7,8,9,10,15} look similar to m = 1,5 (Figures [9 10); times to maturity 



m G A^4 = {20, 30} look similar to m = 20 (Figure 11 ). 



Times to maturity in A^i UTWs seem to have converged, for M2 the convergence picture is 
distorted by the last financial crisis, where volatilities relative to yields have substantially 
increased, see also Figure [5] One might ask whether during financial crisis we should 

10 



apply a different scaling (similar to regime switching models). For A44 the convergence 
picture suggest that we should probably study longer time series (or scaling should be 



done differently). Concluding, this supports the choice of the function h in (4.6). Only 
long times to maturity m € Ai^ might suggest a different scaling. 



For times to maturities in 7W3 U 7W4 we observe that the bias term given in (4.5) is 



negligible, see Figures^ 11 that is, A = 1/52 is sufficiently small for times to maturity 
m > \. For times to maturities in TWi U 7W2 it is however essential that we do a bias 
correction, see Figure [7][8| This comes from the fact that for small times to maturity the 
bias term is driven by z in /A(z,y) which then is of similar order as sa. 

In Table 1 we present the resulting estimated matrix 5]a(1) = (sij(i^))jj=i,,..,rf which is based on 
all observations in {01/2000, . . . , 05/2011}. We observe that the diagonal 'sii{K) is an increasing 
function in the time to maturity rm. Therefore, in order to further analyze this matrix, we 
normalize it as follows (as a correlation matrix) 



^ — \Hij)i,j=l,...,d 



y^MK)y/fJJ{K) ^ 



i,j='i-, 



Now all the entries pij live on the same scale and the result is presented in Figure 12 , We observe 
two different structures, one for times to maturity less than 1 year, i.e. m £ A^i = A^i U A^2i 
and one for times to maturity m E A^2 = -M3 U 7W4. The former times to maturity m G Mi 
were modeled using the observations from the SAR, the latter m £ M2 with observations from 
the Swiss government bond. This separation shows that these two data sets are not completely 
compatible which gives some ,, additional independence" (diversification) between A^i and A^2- 
If we calculate the eigenvalues of H we observe that the first 5 eigenvalues explain 95% of the 
total observed cross-sectional volatility (we have a d = 17 dimensional space). Thus, a principal 
component analysis says that we should at least choose a 5-factor model. These are more factors 
than typically stated in the literature (see Brigo-Mercurio |T], Section 4.1). The reason therefore 
is again that the short end A^i and the long end A^2 of the estimated yield curve behave more 



independently due to different choices of the data (see also Figure 12 ) . If we restrict this principal 
component analysis to M2 we find the classical result that a 3-factor model explains 95% of the 
observed cross-sectional volatility. 

In the next step we analyze the assumption of the independence of Sa(1) = AA' = (sjj)i,j=i,...,d 
from the grid size A. Similar to the analysis above we estimate Sa(1) for the grid sizes A = 
1/52, 1/26, 1/13, 1/4 (weekly, bi-weekly, 4-weekly, quarterly grid size). The first observation is 
that the bias increases with increasing A (for illustrative purposes one should compare Figure 
^with m = 1 and A = 1/52 and Figure [13| with m = 1 and A = 1/4). Of course, this is exactly 
the result expected. 

In Table 2 we give the differences between the estimated matrices Sa(1) = (sij(i^))ij=i,...,d on 
the weekly grid A = 1/52 versus the estimates on a quarterly grid A = 1/4 (relative to the 
estimated values on the quarterly grid). Of course, we can only display these differences for 
times to maturity m G M.2 U M.2 because in the latter model the times to maturity in A^i 
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do not exist. We observe rather small differences within A42 which supports the independence 
assumption from the choice of A within the Swiss government bond yields. For the SAR in 7W2 
this picture does not entirely hold true which has also to do with the fact that the model does not 
completely fit to the data, see Figure [8} Thus, we only observe larger difference for covariances 
that have a bigger difference in times to maturity compared. The pictures for A = 1/26, 1/13 
are quite similar which justify our independence choice. 

Conclusions 5.1 

We conclude that the independence assumption of Sa(1) from A is not violated by our observa- 
tions and that the bias terms f3ij(K) are negligible for maturities mi,mj G M2 and time grids 



A = 1/52,1/16,1/13, therefore we can directly work with model (4.3) to predict future yields 
for times to maturity in A^2- 

5.2 Back-testing and market price of risk 

In this subsection we back-test our model against the observations. We therefore choose a fixed- 
term annuity with nominal payments of size 1 at maturity dates m G M^. The present value of 
this annuity at time t is given by 

TTt = ^ P{t,t + m)= ^ exp{-mY{t,t + m)} ^ ^ l-mY{t,t + m) '^= nt. 

Our back-testing setup is such that we try to predict tt^ based on the observations J-t-A and then 
(one period later) we compare this forecast with the realization of vfi. In view of Conclusions 
we directly work with C/k) foi^ small time grids A (for t = A(K + 1)). Moreover, the Taylor 



5.1 



approximation ttj to vTf is used in order to avoid (time-consuming) simulations. Here a first 
order Taylor expansion is sufficient since the portfolio's variance will be - due to high positive 
correlation ~ quite large in comparison to possible second order - drift like ~ correction terms. 
Such an approximation does not work for short-long portfolios. 
For the approximation (under P*) 

Ttb,„^ « ^,(Y(t-A,t),Yi,_)|^,_^, 
we obtain an approximate forecast to vfj given by (denote the cardinality of A^3 by ^3) 
^tbt-A = d3- ^ (m + A) y(t-A,t + m) + d3 Ay(t-A,t) 

mGMs 

-\ ^'m, SP (%)(Yi,-)) - 1'm^ ?(Yi,_)qfe)W* |_^^_^, (5.1) 

where Imz — {^{i&Ms}^ • • • > ^{deMs})' ^ ^°'- Thus, the conditional distribution of vrt under P*, 
given J-t-A, is a Gaussian distribution with conditional mean and conditional variance given by 

^i*t-A = ds- J^ (m + A)y(t-A,t + m) + d3 Ay(t-A,t)-- 1^3 sp(%)(Yi,_)), 

meM:i 
^t-A = l^fa •S'(X)(Yt-) 1a43- 
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We calculate these conditional moments for t G {01/2005, . . . ,05/2011} based on the cr-fields 
^t-A generated by the data in {01/2000, . . . ,t — A}, for A = 1/52, 1/12 (weekly and monthly 
grid). From these we can calculate the observable residuals 

* _ T^t - f^t-A 

n-A 

The sequence of these observable residuals should approximately look like an i.i.d. standard 
Gaussian distributed sequence. The result for A = 1/52 is given in Figure 14 and for A = 1/12 



in Figure 15 At the first sight this sequence {Zf)t seems to fulfill these requirements, thus 



the out-of-sample back-testing provides the required results. In Figure [16] we also provide the 
Q-Q-plot for the residuals {z^)t against the standard Gaussian distribution for A = 1/52. Also 
in this plot we observe a good fit, except for the tails of the distribution. This suggests that one 
may relax the Gaussian assumption on e^ by a more heavy-tailed model (this can also be seen 



in Figure 14 where we a few outliers). We have already mentioned this in Section^ but for this 



exposition we keep the Gaussian assumption. 

If we calculate the auto-correlation for time lag A between the residuals z* we obtain 5% which 

is a convincingly small value. This supports the assumption having independent residuals. 

The same holds true if we consider the auto-correlation for time lag A between the absolute 

values \z1\ of the residuals resulting in 11%. The only observation which may contradict the 

i.i.d. assumption is that we observe slight clustering in Figure [T4j This non-stationarity might 

have to do with that we calculate the residuals under the equivalent martingale measure P*, 

however we make the observations under the real world probability measure P. If these measures 

coincide the statements are the same. 

The classical approach is that one assumes that the two probability measures are equivalent, 

i.e. P* ~ P, with density process 



^t 



t/A 
JJexp 



1 



A As + A As £Ai 



(5.2) 



with St is independent of J-t-A, -T^t-measurable and a t/A-dimensional standard Gaussian ran- 
dom vector with independent components under P. Moreover, it is assumed that At is d- 
dimensional and previsible, i.e. J^t-A-measurable. Note that this density process {^t)t is a 
strictly positive and normalized (P, F)-martingale. For any P*-integrable and J-j-measurable 
random variable X^ we have, P-a.s., 

1 



n^A [Xt 



U-A 



Et-A iCtXi 



This implies that 



(d) 



under 



t-A- 



At is called market price of risk at time t and reflects the difference between 



t-A 



and 



t-A- 



Under Model Assumptions 4.1 we then obtain under the real world probability measure 



Tt = A 



-Y(t-A,t) + ^sp(SA(Yt,_)) 



+ VA ?(Yt,_) A At + VA <j(Yt,_) A st, 
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i.e. we have a change of drift given by v A c^{Yt^-) A At. Thus, under the (conditional) real 
world probability measure Pt_A the approximate forecast ttj has a Gaussian distribution with 
conditional mean and conditional covariance given by 

fJ't-A = IH-A - ^ ^'ms ?(Yt ,-) A Ai and Tt_^ = 1^3 5(i^)(Yt _) Ims- 

For an appropriate choice of the market price of risk A^ we obtain residuals 



Zt 



Tt-A 



which should then form an i.i.d. standard Gaussian distributed sequence under the real world 

probability measure P. 

In order to detect the market price of risk term, we look at residuals for individual times to 



maturity m £ M, i.e. we replace the indicators 1x3 in (5.1) by indicators l{m}- We denote 



the resulting residuals by z^^ and the corresponding volatilities by Tm,t-A- In Figures 17, 18 



and 19 we show the results for m = 1, 5, 10. The picture is similar to Figure 14, i.e. we observe 



clustering but not a well-defined drift. This implies that we suggest to set the market price of 



risk Aj = for the prediction of future yield curves (we come back to this in Section 5.3) 



5.3 Comparison to the Vasicek model 

We compare our findings to the results in the Vasicek model [10]. The Vasicek model is the 
simplest short rate model that provides an afhne term structure for interest rates (see also 
Filipovic [3]), and hence a closed- form solution for ZCB prices. The price of the ZCB in the 
Vasicek model takes the following form 

P{t, t + m) = exp {A(m) — rt B{m)} , 

where the short rate process {rt)t evolves as an Ornstein-Uhlenbeck process under P*, and A(m) 
and B{m) are constants only depending on the time to maturity m and the model parameters 
A€*, 9* and g (see for instance (3.8) in Brigo-Mercurio [T]). The short rate rt is then under P*_^ 
normally distributed with conditional mean and conditional variance given by 

n-A[n] = n_Ae-^'^*+r(l-e-^''* 

-.2 



9 

2k* 



1 - e-2«*A 



Varj_^(rt) 

Thus, the approximation vrj has under ^l_^ a normal distribution with conditional mean 

EUN= E (l + A(m)-E;_i[r,]B(m)), 



E 

meMa 



and conditional variance 



Var:_A(^t)=Vart*_A(n) I Y. ^^ 
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As in the previous section we assume P* = P, i.e. we set the market price of risk At = 0: 
(i) this allows to estimate the model parameters k* , 0* and g, for instance, using maximum 
likelihood methods (see (3.14)-(3.16) in Brigo-Mercurio jl]); (ii) makes the model comparable 
to the calibration of our model. We will comment on this "comparability" below. 
Thus we estimate these parameters and obtain parameter estimates k* , 9* and g from which we 
get the estimated functions A{-) and -B(-). This then allows to estimate the conditional mean 
and variance of vfi, given Ft- a- From these we calculate the observable residuals 



In Figure 20 we plot the time series z^ and v^ for t € {01/2005, . . . , 05/2011}. The observation 
is that vl is far too small! The explanation for this observation lies in the assumption P* = P, 
i.e. Xt = 0. Since the Vasicek prices are calculated by conditional expectations of the entire 
future development of the short rate rt until expiry of the ZCB, the choice of the market price 
of risk At has a huge influence on the resulting ZCB price in the Vasicek model. Thus, the 
calibration oi A{-) and B{-) is completely wrong if we set At = 0. Compare 

\ogP{t,t + m) = -mY{t,t + m), (5.3) 

logP{t,t + m) = A{m) -rt B{in). (5.4) 

Conditionally, given J-t-A, we model the development from Y{t — A,t + m) to Y{t, t + m) for 



the study of (5.3). That is, we model a change of the yield curve Yt_A at time t — A to Yf 



at time t. Since the yield curve Yf_A already corresponds to market prices it already contains 



the actual market risk aversion, and thus the market price of risk Af in (5.2) only influences one 
single period in our consideration. 



The (pricing) functions A{-) and B{-) in (5.4), however, are calculated completely within the 
Vasicek model by a forward projection of rt until maturity date t + m. If this forward projection 
is done under the wrong measure P, then these pricing components completely miss the market 
risk aversion and hence are not appropriate. Thus, in general, we should have A{m) = A[m, At) 
and B{m) = B{m,Xt) which requires a detailed knowledge of the market price of risk At and, 
thus, the Vasicek model reacts much more sensitively to non- appropriately calibrated equivalent 
martingale measures P*. Note that this is true for all models where ZCB prices are entirely 
determined by the short rate process (rt)f. 

Conclusions 5.2 



We conclude that the HJM models (similar to Model Assumptions 4.1) are much more 
robust against inappropriate choices of the market price of risk compared to short rate 
models, because in the former we only need to choose the market price of risk for the one- 
step ahead for the prediction of the ZCB prices at the end of the period (i.e. from t — A to 
t) whereas for short rate models we need to choose the market price of risk appropriately 
for the entire life time of the ZCB (i.e. from t — A to t + m). 
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Our HJM model (Model Assumptions 4.1 ) always captures the actual yield curve, whereas 



this is not necessarily the case for short rate models. 

5.4 Forward projection of yield curves and arbitrage 

For the calibration of the model and for yield curve prediction we have chosen a restricted set A4 
of times to maturity. In most applied cases one has to stay within such a restricted set because 
there do not exist observations for all times to maturity. We propose that we predict future yield 
curves within these families Ai and then approximate the remaining times to maturity using a 
parametric family like the Nelson-Siegel [^ or the Svensson [U |9] family, see also Filipovic [3]. 



Finally, we demonstrate the absence of arbitrage condition given in Lemma 3.1 At the end 
of Section [T] we have emphasized the importance of the no-arbitrage property of the prediction 
model. Let us choose an asset portfolio wtP{t, t + mi) — P{t, t + m,2) for two different times to 
maturity mi and m.2- We approximate this portfolio by a Taylor expansion up to order 2 and 
set 

1 1 v(,,^ (mir(M + TOi)f\ / ^ , {ni^Y{t,t + m2)f \ 
TTt = wt 1 - ■miY{t, t + mi) ^ ^ m2Y[t, t + m2) -\ . 

Under our model assumptions, the returns of both terms miY{t, t + ?tij) in portfolio ttj have, 
conditionally given J-'t-A, a Gaussian distribution term with standard deviations given by 



^-A = V^U} V)(Y*,-)l{m,} fori = 1,2. 

If we choose wt = Tt-A-f'^t-A then the returns of the Gaussian parts of both terms in portfolio 
TTt have the same variance and, thus, under the Gaussian assumption have the same marginal 
distributions. Since the conditional expectation of the second order term in the Taylor expansion 
cancels the no-arbitrage drift term (up to a small short rate correction) we see that the returns 



of the portfolio nt should provide zero returns conditionally. In Figure 21 we give an example 
for times to maturity rrii = 10 and m.2 = 20. The correlation between the prices of these ZCBs 
is high, about 85%, i.e. their prices tend to move simultaneously. The resulting weights wt are 



in the range between 1.4 and 1.9. In Figure 21 we plot the aggregated realized gains of the 



portfolio TT minus their prognosis including and excluding the HJM correction term. Recall that 
the predicted gains should be zero conditionally on the current information. We observe that 
the model without the HJM term clearly drifts away from zero, which opens the possibility of 
arbitrage. Therefore, we insist on a prediction model that is free of arbitrage. 

A Proofs 

Proof of Theorem |4.4[ In the first step we apply the tower property for conditional expectation which decouples 
the problem into several steps. We have EJ [S'(/-f)(y)] = EJ \^a(k~i) [S'{K){y)]]- Thus, we need to calculate the 
inner conditional expectation ^a(k-i) I'] of the dx d matrix 5'(K)(y)- We define the auxiliary matrix 



\ J/j=l,...,d; k=l,...,K 
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This implies that we can rewrite C{^k) ~ K ^'^ C'(-K')- Moreover, we rewrite the matrix C^k) as follows 

C{K) ~ C'(k-i),';(Yax,-) Tak , 
with C(K-i) G K'^^^f^^i) is J^A(A'-i)-measurable. This implies the following decomposition 



S(K){y) = -^ ?(y) C(K) c[k) ?(y)' 



= -77 <f(y) [c'(k-i),'?(Yaa-,-) ^ TakJ |^C(x-i),<;(YAif,-) ^ Taa'J c(y)' 
= -^ c(y) (C(K-i) C('a-i) + (^(Yak,-)"' Yak) (<;(Yak,-)-' Tak)') ?(y)' 
5(K-i)(y) + ^ ?(y) ?(Yak,-)-' Tak T^k (?(Yak,-)-')' ?(y)'. 



A"- 1 



This implies for the conditional expectation of S(x) (y) 

El(K-i) [S(A')(y)] = ^^^ V-i)(y) + ;^ <?(y) ?(Yaa,-)-' ^Uk-d [Taa- TkA-] (?(Yaa,-)-')' ";(y)'. 

We calculate the conditional expectation in the last term, we start with the conditional covariance. From Lemma 
14. 21 we obtain 

- ?(y) ^(Yaa,-)"^ Cova(a-i) (TfAA) (c(Yaa,-)"^)' <;(y)' 



K 



§ ?(y) c(Yaa,-)'^ Ea(Yaa,-) (c(Yaa,-)'^)' <;(y)' = | SA(y). 



This implies 



ES[V)(y)] = ^^ES[5(A-i)(y)]+^SA(y) 



+ -^ ?(y) ES [?(Yaa,-)-' Ea(a-i) [Taa] El(^_i) [Taa]' (?(Yaa,-)-')'] "^(y)' 
^" ^ "S [5(A-i)(y)] + ^ EA(y) + ^ <;(y) E,* [/a(Y(A(7^ - 1), Ai^), Yaa,-)] <;(y)'. 



Iterating this provides the result 
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-SAR(3 months) LIB0RCHF(3 months} spread 



Figure 1: Yield curve time series 3 Months SAR and 3 Months CHF LIBOR from 01/2000 until 
05/2011. The spread gives the difference between these two time series. 
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- government bond (5 years) swap rates (5 years) spread 



Figure 2: Yield curve time series Swiss government bond and CHF swap rate both for time to 
maturity m = 5 years. The spread gives the difference between these two time series. 
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Figure 3: Time series T^ for t G {01/2000, . . . ,05/2011} on a weekly grid A = 1/52. 
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Figure 4: Component- wise ordered time series obtained from Tj for t € {01/2000, . . . , 05/2011}, 
i.e. Tu\„i < T(4_|_i\ ,^ for all t and m £ Ai on a weekly grid A = 1/52. 
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Figure 5: Time series Tt,m and [vK C(^x)]m = '^t,m/h{Y{t — A,t + m)) for maturity m = 1/52 
and t G {01/2000, . . . , 05/2011} on a weekly grid A = 1/52. 







Figure 6: Time series Tt^m and [s/K Ci^K^]m = '^t,m/h{Y{t — A,t + m)) for maturity m = 5 and 
t E {01/2000, . . . , 05/2011} on a weekly grid A = 1/52. 
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Figure 7: Time series 's^l^{K) and ^{{K), K = 1,...,600, for maturity mi = 1 week and 
observations in {01/2000, . . . , 05/2011} on a weekly grid A = 1/52. 
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Figure 8: Time series '^^^'^{K) and 'sii{K), K = 1, . . . ,600, for maturity rrii = 3 months and 
observations in {01/2000, . . . , 05/2011} on a weekly grid A = 1/52. 
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Figure 9: Time series s^^^^{K) and 'su^K), K = 1,...,600, for maturity nii = 1 year and 
observations in {01/2000, . . . , 05/2011} on a weekly grid A = 1/52. 
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Figure 10: Time series s'll^°'{K) and 'sa^K), K = 1,...,600, for maturity rrii = 5 years and 
observations in {01/2000, . . . , 05/2011} on a weekly grid A = 1/52. 
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Figure 11: Time series 'sf^^^(K) and 'su^K), K = 1, . . . ,600, for maturity nii 
observations in {01/2000, . . . , 05/2011} on a weekly grid A = 1/52. 



20 years and 




QJ -i^ +J -C 

CD CD C -^ 

> <D O S 

£ 5 E ? 



l/l 


i/l 


i/i 


i/l 


i/i 


i/i 


i/i 


l/l 


U1 
































m 


m 


TO 


TO 


ra 


ra 


ra 


ra 


ra 


ra 


ra 


ra 


(1) 


(1) 


CI) 


CI) 


Cl) 


Cl) 


Cl) 


Cl) 


Cl) 


Cl) 


Cl) 


Cl) 


>- 


>- 


>- 


>- 


>- 


>- 


>- 


>- 


>- 


>- 


>- 


>■ 


(N 


m 


^ 


in 


tD 


1^ 


00 


en 


o 


in 


o 


o 
m 



2 weeks 
1 week 



Figure 12: Estimated matrix S = (j)ij)ij^i^ 
on a weekly grid A = 1/52. 
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Figure 13: Time series s^'^^(K) and 'sii{K), K = 1,...,600, for maturity mi 
observations in {01/2000, . . . , 05/2011} on a monthly grid A = 1/12. 
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Figure 14: Time series of residuals z^ for t G {01/2005, . . . , 05/2011} on a weekly grid A = 1/52. 
The axis on the right-hand side displays the time series of Tt-A ■ 
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Figure 15: Time series of residuals z^ for t G {01/2005, . . . ,05/2011} on a monthly grid A 
1/12. The axis on the right-hand side displays the time series of Tt-A- 
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Figure 16: Q-Q-plot of the residuals (2^)* against the standard Gaussian distribution for A 
1/52. 
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Figure 17: Time series of residuals z*^ ^ for time to maturity m = 1 and t G 
{01/2005, . . . ,05/2011} on a weekly grid A = 1/52. The axis on the right-hand side displays 
the time series of r^ t-A- 



27 




- residuals (5 years) 0.95 confidence (-) 0.95 confidence (+) 
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Figure 18: Time series of residuals z^ ^ 
{01/2005, . . . , 05/2011} on a weekly grid A = 
the time series of Tm t-A- 



for time to maturity m = 5 and t G 
1/52. The axis on the right-hand side displays 




- residuals (10 years) 



0.95 confidence (-) 0.95 confidence (+] 



- std.dev. tau (10 years) 



Figure 19: Time series of residuals z^ ^ 
{01/2005, . . . , 05/2011} on a weekly grid A = 
the time series of Tm,t-A- 



for time to maturity m = 10 and t G 
1/52. The axis on the right-hand side displays 
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Figure 20: Time series of residuals z^ and v^ for t S {01/2005, . . . , 05/2011} on a monthly grid 
A = 1/12 under the assumption P* = P. 




-arbitrage-free 



-without HJM correction term 



Figure 21: Back testing the difference of aggregated reahzed gains of portfoho vrt for wt = 
Tt-^/Tt-A ^^^'^ ^^^ their model prognosis with and without the no-arbitrage HJM correction 
term. 
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